NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Progressive and Scalable Hotspot Detection through Local K-Function in Spatial Networks

https://doi.org/10.1145/3766549

Kang, Yunfan; Liu, Yongyi; Juvekar, Pratham; Mahmood, Ahmed; Wang, Shaowen; Magdy, Amr (September 2025, ACM Transactions on Spatial Algorithms and Systems)

The widespread availability of geotagged data combined with modern map services allows for the accurate attachment of data to spatial networks. Applying statistical analysis, such as hotspot detection, over spatial networks is very important for precise quantification and patterns analysis, which empowers effective decision-making in various important applications. Existing hotspot detection algorithms on spatial networks either lack sufficient statistical evidence on detected hotspots, such as clustering, or they provide statistical evidence at a prohibitive computational overhead. In this paper, we propose efficient algorithms for detecting hotspots based on the network local K-function for predefined and unknown hotspot radii. The K-function is a widely adopted statistical approach for network pattern analysis that enables the understanding of the density and distribution of activities and events happening within the spatial network. However, its practical application has been limited due to the inefficiency of state-of-the-art algorithms, particularly for large-sized networks. Extensive experimental evaluation using real and synthetic datasets shows that our algorithms are up to 28 times faster than the state-of-the-art algorithms in computing hotspots with a predefined radius and up to more than four orders of magnitude faster in identifying hotspots without a predefined radius. Additionally, to address dynamic changes in the spatial network, we propose an incremental hotspot detection approach that efficiently updates hotspot computations by leveraging prior results as new events are added.
more » « less
Free, publicly-accessible full text available September 11, 2026
Pyneapple-L: Scalable Expressive Learning-based Spatial Analysis

https://doi.org/10.1145/3678717.3691228

Liu, Yongyi; Lee, Nicolas; Kang, Yunfan; Shahneh, Mohammad Reza; Mahmood, Ahmed; Chinnam, Vishal Rohith; Sarawadekar, Aparna Vivek; Oymak, Samet; Sabek, Ibrahim; Magdy, Amr (October 2024, ACM)

Full Text Available
Scalable Spatio-temporal Top-k Interaction Queries on Dynamic Communities

https://doi.org/10.1145/3648374

Almaslukh, Abdulaziz; Liu, Yongyi; Magdy, Amr (March 2024, ACM Transactions on Spatial Algorithms and Systems)

Social media platforms generate massive amounts of data that reveal valuable insights about users and communities at large. Existing techniques have not fully exploited such data to help practitioners perform a deep analysis of large online communities. Lack of scalability hinders analyzing communities of large sizes and requires tremendous system resources and unacceptable runtime. This article proposes a new analytical query that identifies the top-kposts that a given user community has interacted with during a specific time interval and within a spatial range. We propose a novel indexing framework that captures the interactions of users and communities to provide a low query latency. Moreover, we propose exact and approximate algorithms to process the query efficiently and utilize the index content to prune the search space. The extensive experimental evaluation on real data has shown the superiority of our techniques and their scalability to support large online communities.
more » « less
Full Text Available
Pyneapple-R: Scalable and Expressive Spatial Regionalization

https://doi.org/10.1109/ICDE60146.2024.00437

Kang, Yunfan; Liu, Yongyi; Alrashid, Hussah; Bilgi, Akash; Purohit, Siddhant; Mahmood, Ahmed; Rey, Sergio; Magdy, Amr (May 2024, IEEE)

Full Text Available
Scalable Evaluation of Local K-Function for Radius-Accurate Hotspot Detection in Spatial Networks

https://doi.org/10.1145/3589132.3625646

Liu, Yongyi; Kang, Yunfan; Mahmood, Ahmed; Magdy, Amr (November 2023, The 31st ACM International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL '23))

The widespread of geotagged data combined with modern map services allows for the accurate attachment of data to spatial networks. Applying statistical analysis, such as hotspot detection, over spatial networks is very important for precise quantification and patterns analysis, which empowers effective decision-making in various important applications. Existing hotspot detection algorithms on spatial networks either lack statistical evidence on detected hotspots, such as clustering, or they provide statistical evidence at a prohibitive computational overhead. In this paper, we propose efficient algorithms for detecting hotspots based on the network local K-function for predefined and unknown hotspot radii. The network local K-function is a widely adopted statistical approach for network pattern analysis that enables the understanding of the density and distribution of activities and events in the spatial network. However, its practical application has been limited due to the inefficiency of existing algorithms, particularly for large-sized networks. Extensive experimental evaluation using real and synthetic datasets shows that our algorithms are up to 28 times faster than the state-of-the-art algorithms in computing hotspots with a predefined radius and up to more than four orders of magnitude faster in identifying hotspots without a predefined radius.
more » « less
Full Text Available
PAGE: Parallel Scalable Regionalization Framework

https://doi.org/10.1145/3611011

Alrashid, Hussah; Liu, Yongyi; Magdy, Amr (September 2023, ACM Transactions on Spatial Algorithms and Systems)

Regionalization techniques group spatial areas into a set of homogeneous regions to analyze and draw conclusions about spatial phenomena. A recent regionalization problem, called MP-regions, groups spatial areas to produce a maximum number of regions by enforcing a user-defined constraint at the regional level. The MP-regions problem is NP-hard. Existing approximate algorithms for MP-regions do not scale for large datasets due to their high computational cost and inherently centralized approaches to process data. This article introduces a parallel scalable regionalization framework (PAGE) to support MP-regions on large datasets. The proposed framework works in two stages. The first stage finds an initial solution through randomized search, and the second stage improves this solution through efficient heuristic search. To build an initial solution efficiently, we extend traditional spatial partitioning techniques to enable parallelized region building without violating the spatial constraints. Furthermore, we optimize the region building efficiency and quality by tuning the randomized area selection to trade off runtime with region homogeneity. The experimental evaluation shows the superiority of our framework to support an order of magnitude larger datasets efficiently compared to the state-of-the-art techniques while producing high-quality solutions.
more » « less
Full Text Available
U-ASK: a unified architecture for kNN spatial-keyword queries supporting negative keyword predicates

https://doi.org/10.1145/3557915.3560975

Liu, Yongyi; Magdy, Amr (November 2022, The International Conference on Advances in Geographic Information Systems)

Full Text Available
SMP: Scalable Max-P Regionalization

https://doi.org/10.1145/3557915.3561011

Alrashid, Hussah; Liu, Yongyi; Magdy, Amr (January 2022, ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL))

Full Text Available
PRUC: P-regions with user-defined constraint

https://doi.org/10.14778/3494124.3494133

Liu, Yongyi; Mahmood, Ahmed R.; Magdy, Amr; Rey, Sergio (November 2021, Proceedings of the VLDB Endowment)

This paper introduces a generalized spatial regionalization problem, namely, PRUC ( P -Regions with User-defined Constraint) that partitions spatial areas into homogeneous regions. PRUC accounts for user-defined constraints imposed over aggregate region properties. We show that PRUC is an NP-Hard problem. To solve PRUC, we introduce GSLO (Global Search with Local Optimization), a parallel stochastic regionalization algorithm. GSLO is composed of two phases: (1) Global Search that initially partitions areas into regions that satisfy a user-defined constraint, and (2) Local Optimization that further improves the quality of the partitioning with respect to intra-region similarity. We conduct an extensive experimental study using real datasets to evaluate the performance of GSLO. Experimental results show that GSLO is up to 100× faster than the state-of-the-art algorithms. GSLO provides partitioning that is up to 6× better with respect to intra-region similarity. Furthermore, GSLO is able to handle 4× larger datasets than the state-of-the-art algorithms.
more » « less
Full Text Available

Search for: All records